A New Damping Strategy of Levenberg-marquardt Algorithm for Multilayer Perceptrons
نویسندگان
چکیده
In this paper, a new adjustment to the damping parameter of the Levenberg-Marquardt algorithm is proposed to save training time and to reduce error oscillations. The damping parameter of the Levenberg-Marquardt algorithm switches between a gradient descent method and the Gauss-Newton method. It also affects training speed and induces error oscillations when a decay rate is fixed. Therefore, our damping strategy decreases the damping parameter with the inner product between weight vectors to make the Levenberg-Marquardt algorithm behave more like the Gauss-Newton method, and it increases the damping parameter with a diagonally dominant matrix to make the Levenberg-Marquardt algorithm act like a gradient descent method. We tested two simple classifications and a handwritten digit recognition for this work. Simulations showed that our method improved training speed and error oscillations were fewer than those of other algorithms.
منابع مشابه
Optimization in companion search spaces: the case of cross-entropy and the Levenberg-Marquardt algorithm
We present a new learning algorithm for the supervised training of multilayer perceptrons for classification that is significantly faster than any previously known method. Like existing methods, the algorithm assumes a multilayer perceptron with a normalized exponential (softmax) output trained under a cross-entropy criterion. However, this output-criteria pairing turns out to have poor propert...
متن کاملGas-non-Newtonian Liquid Flow Through Horizontal Pipe – Gas Holdup and Pressure Drop Prediction using Multilayer Perceptron
Prediction of the gas holdup and pressure drop in a horizontal pipe for gas-non-Newtonian liquid flow using Artificial Neural Networks (ANN) methodology have been reported in this paper from the data acquired from our earlier experiment. The ANN prediction is done using Multilayer Perceptrons (MLP) trained with three different algorithms, namely: Backpropagation (BP), Scaled Conjugate gradient ...
متن کاملDamping–undamping strategies for the Levenberg–Marquardt nonlinear least-squares method
The speed of the Levenberg–Marquardt ~LM! nonlinear iterative least-squares method depends upon the choice of damping strategy when the fitted parameters are highly correlated. Additive damping with small damping increments and large damping decrements permits LM to efficiently solve difficult problems, including those that otherwise cause stagnation. © 1997 American Institute of Physics. @S089...
متن کاملMulti-layer perceptrons with Levenberg- Marquardt training algorithm for suspended sediment concentration prediction and estimation
The prediction and estimation of suspended sediment concentration are investigated by using multi-layer perceptrons (MLP). The fastest MLP training algorithm, that is the Levenberg-Marquardt algorithm, is used for optimization of the network weights for data from two stations on the Tongue River in Montana, USA. The first part of the study deals with prediction and estimation of upstream and do...
متن کاملA new Levenberg-Marquardt approach based on Conjugate gradient structure for solving absolute value equations
In this paper, we present a new approach for solving absolute value equation (AVE) whichuse Levenberg-Marquardt method with conjugate subgradient structure. In conjugate subgradientmethods the new direction obtain by combining steepest descent direction and the previous di-rection which may not lead to good numerical results. Therefore, we replace the steepest descentdir...
متن کامل